SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(1) APPLICANT: 

(A) NAME: CLOUGH et al ■ - - 

(B) STREET: National Institute for Medical Research, Mill 

Hill 

(C) CITY: London 

(E) COUNTRY: United Kingdom 

(F) POSTAL CODE (ZIP): NW7 lAA 

(ii) TITLE OF INVENTION: AN EF-TU PROTEIN ENCODED ON THE PLASTID DNA 
OF THE MALARIA PARASITE AND PROTEIN SYNTHESIS INHIBITORS 
EFFECTIVE AS ANTI -MALARIAL COMPOUNDS 

(iii) NUMBER OF SEQUENCES: 2 

(iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC -DOS/MS -DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 

(V) CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US NA 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1230 base pairs 

(B) TYPE: nuclGic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 
10 (A) NAME/KEY: CDS 

(B) LOCATION:!.. 1230 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

15 

ATG AAT AAT AAA TTA TTT TTA AGA AAT AAA CAA CAT ATA AAT 1TA GGT 48 
Met Asn Asn Lys Leu Phe Leu Arg Asn Lys Gin His He Asn Leu Gly 
15 10 15 

20 ACT ATA GGG CAT GTA GAT CAT GGA AAA ACT ACA TTA ACA ACA GCT ATA 96 
Thr He Gly His Val Asp His Gly Lys Thr Thr Leu Thr Thr Ala He 
20 25 30 

TCT TAT TTA TTA AAT TTA CAA GGA TTA TCA AAA AAA TAT AAT TAT TCA 144 
25 Ser Tyr Leu Leu Asn Leu Gin Gly Leu Ser Lys Lys Tyr Asn Tyr Ser 
35 40 45 

GAT ATT GAT TCA GCT CCA GAA GAA AAA ATA AGA GGT ATT ACA ATA AAT 192 
Asp He Asp Ser Ala Pro Glu Glu Lys He Arg Gly He Thr He Asn 
30 50 55 ' 60 

ACA ACA CAT ATT GAA TAT GAA ACT TTA ACA AAA CAT TGT GCT CAT ATA 240 
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Thr Thr His He Glu Tyr Glu Thr Leu Thr Lys His Cys Ala His He 
65 70 75 80 



GAT TGT CCA GGA CAT TCC GAT TAT ATT AAA AAT ATG AH ATA GGA GCC 
Asp Cys Pro Gly His Ser Asp Tyr He Lys Asn Met He He Gly Ala 
85 90 95 



288 



ACA CAA ATG GAT ATA GCA ATT TTA GTA ATA TCT ATA ATA GAT GGT ATA 
Thr Gin Met Asp He Ala He Leu Val He Ser He He Asp Gly He 
100 105 110 

ATG CCT CAA ACT TAT 6AA CAT TTA TTA TTA ATA AAA CAA ATA GGT ATA 
Met Pro Gin Thr Tyr Glu His Leu Leu Leu He Lys Gin He Gly He 
115 120 125 



AAA AAT ATA ATT ATT TTT TTA AAT AAA GAA GAT TTA TGT GAT GAT GTT 
Lys Asn He He He Phe Leu Asn Lys Glu Asp Leu Cys Asp Asp Val 
130 135 140 



432 



GAA TTA ATA GAT TTT ATA AAA TTA GAA GTA AAT GAA TTA TTA ATT AAA 
Glu Leu He Asp Phe He Lys Leu Glu Val Asn Glu Leu Leu He Lys 
145 150 155 160 



TAT AAT TTT GAT TTA AAT TAT ATA CAT ATA TTA ACT GGT TCA GCA TTA 
Tyr Asn Phe Asp Leu Asn Tyr He His He Leu Thr Gly Ser Ala Leu 
165 170 175 



528 



AAT GTA ATA AAT ATA ATT CAA AAA AAT AAG GAT TAT GAA TTA ATA AAA 
Asn Val He Asn He He Gin Lys Asn Lys Asp Tyr Glu Leu He Lys 
180 185 190 



576 



TCT AAT ATT TGG ATA CAA AAA TTA AAT AAT TTA ATT CAA ATA ATT GAT 
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624 



Ser Asn He Trp He Gin Lys Leu Asn Asn Leu He Gin He lie Asp 
195 200 205 



AAT ATT ATA ATA CCT ACT AGA AAA ATT AAT GAT TAG TTT TTA ATG TCA 
5 Asn He He He Pro Thr Arg Lys He Asn Asp Tyr Phe Leu Met Ser 
210 215 220 

ATA GAA GAT GTA TTT TCT ATA ACA GGT AGA GGT ACA GTA GTA AGA GGT 
He Glu Asp Val Phe Ser He Thr Gly Arg Gly Thr Val Val Thr Gly 
10 225 230 235 240 



AAG ATT GAA CAA GGA TGT ATA AAT TTA AAT GAT GAA AH GAA ATT TTA 
Lys He Glu Gin Gly Cys He Asn Leu Asn Asp Glu He Glu He Leu 
245 250 255 



768 



AAA TTT GAA AAA TCA TCT CCT AAT HA ACA ACA GTT ATA GGA TTA GAA 
Lys Phe Glu Lys Ser Ser Pro Asn Leu Thr Thr Val He Gly Leu Glu 
260 265 270 



816 



ATG TTT AAA AAA CAA TTA ACA CAA GCA CAA TCC GGA GAT AAT GTA GGT 
Met Phe Lys Lys Gin Leu Thr Gin Ala Gin Ser Gly Asp Asn Val Gly 
275 280 285 



864 



ATT TTA TTA AGA AAT ATT CAA AAA AAA GAT ATA AAA AGA GGT ATG ATT 
25 He Leu Leu Arg Asn He Gin Lys Lys Asp He Lys Arg Gly Met He 
290 295 300 



912 



TTA GCA ACA CCT AAT AAA TTA AAA GTA TAT AAG TCT TTT ATA GCT GAA 
Leu Ala Thr Pro Asn Lys Leu Lys Val Tyr Lys Ser Phe He Ala Glu 
30 305 310 315 320 



960 



ACA TAT ATT TTA ACT AAA GAA GAA GGT GGT CGT CAT AAA CCT TTT AAT 

-36- 



1008 



Thr Tyr He Leu Thr Lys Glu Glu Gly Gly Arg His Lys Pro Phe Asn 
" 325 330 335 



ATT GGA TAT AAA CCT CAA TH TTT ATT CGT ACA GTA GAT GTT ACT G6A 
He Gly Tyr Lys Pro Gin Phe Phe He Arg Thr Val Asp Val Thr Gly 
340 345 350 



1056 



GAA ATT AAA AAT ATA TAT TTA AAT GAA AAT GTA CAA AAA GTA GCT ATA 
Glu He Lys Asn lie Tyr Leu Asn Glu Asn Val Gin Lys Val Ala He 
10 355 360 365 



1104 



CCT GGA GAT AAA ATA ACA TTA CAT ATT GAA TTA AAA CAT TAT ATA GTG 
Pro Gly Asp Lys He Thr Leu His He Glu Leu Lys His Tyr He Val 
370 375 380 



1152 



TTG ACA TTA AAT ATG AAA TTT TCT ATT AGA GAA GGA GGA AAA ACA ATA 
Leu Thr Leu Asn Met Lys Phe Ser He Arg Glu Gly Gly Lys Thr He 
385 390 395 400 



1200 



20 GGA GCA GGT ATT ATA ACA GAA ATA AAA AAT 
Gly Ala Gly He He Thr Glu He Lys Asn 
405 410 



25 (2) INFORMATION FOR SEQ ID NO: 2: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 amino acids 

(B) TYPE: amino acid 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 



Met Asn Asn Lys Leu Phe Leu Arg Asn Lys Gin His He Asn Leu Gly 
15 10 15 

Thr He Gly His Val Asp His Gly Lys Thr Thr Leu Thr Thr Ala He 
20 25 30 

Ser Tyr Leu Leu Asn Leu Gin Gly Leu Ser Lys Lys Tyr Asn Tyr Ser 
35 40 45 

Asp He Asp Ser Ala Pro Glu Glu Lys He Arg Gly He Thr He Asn 
50 55 60 

Thr Thr His He Glu Tyr Glu Thr Leu Thr Lys His Cys Ala His He 
65 70 75 80 

Asp Cys Pro Gly His Ser Asp Tyr He Lys Asn Met He He Gly Ala 
85 90 95 

Thr Gin Met Asp He Ala He Leu Val He Ser He He Asp Gly He 
100 105 110 

Met Pro Gin Thr Tyr Glu His Leu Leu Leu He Lys Gin He Gly He 
115 120 125 

Lys Asn He He He Phe Leu Asn Lys Glu Asp Leu Cys Asp Asp Val 
130 135 140 

Glu Leu He Asp Phe He Lys Leu Glu Val Asn Glu Leu Leu He Lys 
145 150 155 160 
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Tyr Asn Phe Asp Leu Asn Tyr He His He Leu Thr Gly Ser Ala Leu 
165 170 175 



Asn Val He Asn He He Gin Lys Asn Lys Asp Tyr Glu Leu He Lys 
180 185 190 

Ser Asn He Trp He Gin Lys Leu Asn Asn Leu He Gin He He Asp 
195 200 205 

Asn He He He Pro Thr Arg Lys He Asn Asp Tyr Phe Leu Met Ser 
210 215 220 

He Glu Asp Val Phe Ser He Thr Gly Arg Gly Thr Val Val Thr Gly 
225 230 235 240 

Lys He Glu Gin Gly Cys He Asn Leu Asn Asp Glu He Glu He Leu 
245 250 255 

Lys Phe Glu Lys Ser Ser Pro Asn Leu Thr Thr Val He Gly Leu Glu 
260 265 270 

Met Phe Lys Lys Gin Leu Thr Gin Ala Gin Ser Gly Asp Asn Val Gly 
275 280 285 

He Leu Leu Arg Asn He Gin Lys Lys Asp He Lys Arg Gly Met He 
290 295 300 

Leu Ala Thr Pro Asn Lys Leu Lys Val Tyr Lys Ser Phe He Ala Glu 
305 310 315 320 

Thr Tyr He Leu Thr Lys Glu Glu Gly Gly Arg His Lys Pro Phe Asn 
325 330 335 
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He Gly Tyr Lys Pro Gin Phe Phe He Arg Thr Val Asp Val Thr Gly 
340 345 350 



61 u He Lys Asn He Tyr Leu Asn 
5 355 360 

Pro Gly Asp Lys He Thr Leu His 
370 375 

10 Leu Thr Leu Asn Met Lys Phe Ser 
385 390 

Gly Ala Gly He He Thr Glu He 
405 



Glu Asn Val Gin Lys Val Ala He 
365 

He Glu Leu Lys His Tyr He Val 
380 

He Arg Glu Gly Gly Lys Thr He 
395 400 

Lys Asn 
410 
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